智能论文笔记

GWRBoost:A geographically weighted gradient boosting method for explainable quantification of spatially-varying relationships

Han Wang , Zhou Huang , Ganmin Yin , Yi Bao , Xiao Zhou , Yong Gao

分类：机器学习 | (统计)机器学习

2022-12-12

The geographically weighted regression (GWR) is an essential tool for estimating the spatial variation of relationships between dependent and independent variables in geographical contexts. However, GWR suffers from the problem that classical linear regressions, which compose the GWR model, are more prone to be underfitting, especially for significant volume and complex nonlinear data, causing inferior comparative performance. Nevertheless, some advanced models, such as the decision tree and the support vector machine, can learn features from complex data more effectively while they cannot provide explainable quantification for the spatial variation of localized relationships. To address the above issues, we propose a geographically gradient boosting weighted regression model, GWRBoost, that applies the localized additive model and gradient boosting optimization method to alleviate underfitting problems and retains explainable quantification capability for spatially-varying relationships between geographically located variables. Furthermore, we formulate the computation method of the Akaike information score for the proposed model to conduct the comparative analysis with the classic GWR algorithm. Simulation experiments and the empirical case study are applied to prove the efficient performance and practical value of GWRBoost. The results show that our proposed model can reduce the RMSE by 18.3\% in parameter estimation accuracy and AICc by 67.3\% in the goodness of fit.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

Power Efficient Video Super-Resolution on Mobile NPUs with Deep Learning, Mobile AI & AIM 2022 challenge: Report

Andrey Ignatov , Radu Timofte , Cheng-Ming Chiang , Hsien-Kai Kuo , Yu-Syuan Xu , Man-Yu Lee , Allen Lu , Chia-Ming Cheng , Chih-Cheng Chen , Jia-Ying Yong

分类：计算机视觉

2022-11-07

Video super-resolution is one of the most popular tasks on mobile devices, being widely used for an automatic improvement of low-bitrate and low-resolution video streams. While numerous solutions have been proposed for this problem, they are usually quite computationally demanding, demonstrating low FPS rates and power efficiency on mobile devices. In this Mobile AI challenge, we address this problem and propose the participants to design an end-to-end real-time video super-resolution solution for mobile NPUs optimized for low energy consumption. The participants were provided with the REDS training dataset containing video sequences for a 4X video upscaling task. The runtime and power efficiency of all models was evaluated on the powerful MediaTek Dimensity 9000 platform with a dedicated AI processing unit capable of accelerating floating-point and quantized neural networks. All proposed solutions are fully compatible with the above NPU, demonstrating an up to 500 FPS rate and 0.2 [Watt / 30 FPS] power consumption. A detailed description of all models developed in the challenge is provided in this paper.

translated by 谷歌翻译

Inductive Graph Transformer for Delivery Time Estimation

Xin Zhou , Jinglong Wang , Yong Liu , Xingyu Wu , Zhiqi Shen , Cyril Leung

分类：机器学习

2022-11-05

Providing accurate estimated time of package delivery on users' purchasing pages for e-commerce platforms is of great importance to their purchasing decisions and post-purchase experiences. Although this problem shares some common issues with the conventional estimated time of arrival (ETA), it is more challenging with the following aspects: 1) Inductive inference. Models are required to predict ETA for orders with unseen retailers and addresses; 2) High-order interaction of order semantic information. Apart from the spatio-temporal features, the estimated time also varies greatly with other factors, such as the packaging efficiency of retailers, as well as the high-order interaction of these factors. In this paper, we propose an inductive graph transformer (IGT) that leverages raw feature information and structural graph data to estimate package delivery time. Different from previous graph transformer architectures, IGT adopts a decoupled pipeline and trains transformer as a regression function that can capture the multiplex information from both raw feature and dense embeddings encoded by a graph neural network (GNN). In addition, we further simplify the GNN structure by removing its non-linear activation and the learnable linear transformation matrix. The reduced parameter search space and linear information propagation in the simplified GNN enable the IGT to be applied in large-scale industrial scenarios. Experiments on real-world logistics datasets show that our proposed model can significantly outperform the state-of-the-art methods on estimation of delivery time. The source code is available at: https://github.com/enoche/IGT-WSDM23.

translated by 谷歌翻译

Prompt-driven efficient Open-set Semi-supervised Learning

Haoran Li , Chun-Mei Feng , Tao Zhou , Yong Xu , Xiaojun Chang

分类：计算机视觉

2022-09-28

开放设定的半监督学习（OSSL）引起了人们日益增长的兴趣，该学习调查了一个更实用的情况，在该情况下，仅在未标记的数据中包含了分布式（OOD）样本。现有的OSSL方法（例如OpenMatch）学习一个OOD检测器以识别离群值，该检测器通常会更新所有模态参数（即完整的微调），以从标记的数据传播类信息到未标记的数据。当前，已经开发了及时的学习来弥合预训练和微调之间的差距，这在几个下游任务中显示出较高的计算效率。在本文中，我们提出了一个迅速驱动的有效OSSL框架，称为OpenPrompt，该框架可以将类别的类信息传播到标记到未标记数据的类信息，只有少数可训练的参数。我们提出了一种迅速驱动的关节空间学习机制来检测OOD数据，通过在未标记的数据中最大化ID和OOD样本之间的分布差距，从而使我们的方法可以以新的方式检测到异常值。三个公共数据集的实验结果表明，OpenPrompt优于不到1％可训练参数的最先进方法。更重要的是，OpenPrompt在CIFAR10上完全监督模型的AUROC检测方面取得了4％的改善。

translated by 谷歌翻译

Class-Imbalanced Complementary-Label Learning via Weighted Loss

Meng Wei , Yong Zhou , Zhongnian Li , Xinzheng Xu

分类：机器学习

2022-09-28

互补标签学习（CLL）是弱监督的情况下的常见应用。但是，在实际数据集中，CLL遇到了平衡的培训样本，其中一个类的样品的数量明显低于其他类别的样本。不幸的是，现有的CLL方法尚未探索类饮食样本的问题，从而降低了预测准确性，尤其是在不平衡的类中。在本文中，我们提出了一个新颖的问题设置，以允许从类不平衡的互补标签样品中学习以进行多类分类。因此，为了解决这个新的问题，我们提出了一种新的CLL方法，称为加权互补标签学习（WCLL）。提出的方法通过利用类不平衡互补标记的信息来模拟加权的经验风险损失，这也适用于多类不平衡训练样本。此外，提出的方法的估计误差结合是提供理论保证的。最后，我们对广泛使用的基准数据集进行了广泛的实验，以通过将其与现有最新方法进行比较来验证我们的方法的优势。

translated by 谷歌翻译

Joint Learning Content and Degradation Aware Feature for Blind Super-Resolution

Yifeng Zhou , Chuming Lin , Donghao Luo , Yong Liu , Ying Tai , Chengjie Wang , Mingang Chen

分类：计算机视觉

2022-08-29

为了在盲图超级分辨率（SR）上取得有希望的结果，一些尝试利用低分辨率（LR）图像来预测内核并改善SR性能。但是，由于不可用的现实世界模糊内核，这些监督的内核预测（SKP）方法是不切实际的。尽管提出了一些无监督的降解预测（UDP）方法来绕过此问题，但\ textIt {contercestency}之间的降解嵌入和SR功能之间仍然具有挑战性。通过探索降解嵌入与SR功能之间的相关性，我们观察到共同学习内容和降解感知功能是最佳的。基于此观察结果，提出了一个名为CDSR的内容和退化的SR网络。具体而言，CDSR包含三个新建立的模块：（1）将基于重量的编码器（LPE）应用于共同提取内容和降解功能；（2）采用基于域查询的基于注意力的模块（DQA）来适应不一致；（3）基于密码的空格压缩模块（CSC），可以抑制冗余信息。对几个基准测试的广泛实验表明，即使与最先进的SKP方法相比，提议的CDSR的表现都优于现有的UDP模型，并在PSNR和SSIM上实现竞争性能。

translated by 谷歌翻译

Knowledge-Injected Federated Learning

Zhenan Fan , Zirui Zhou , Jian Pei , Michael P. Friedlander , Jiajie Hu , Chengliang Li , Yong Zhang

分类：机器学习 | 人工智能

2022-08-16

联合学习是一种来自分散数据集的培训模型的新兴技术。在许多应用程序中，参与联合学习系统的数据所有者不仅拥有数据，还拥有一组域知识。这些知识包括人类的知识和工艺，对联邦学习任务非常有帮助。在这项工作中，我们提出了一个联合学习框架，该框架允许注入参与者的领域知识，其中关键思想是通过本地知识来完善全球模型。我们认为的方案是由真正的行业级应用激励的，我们证明了我们采用该应用的有效性。

translated by 谷歌翻译

Effidit: Your AI Writing Assistant

Shuming Shi , Enbo Zhao , Duyu Tang , Yan Wang , Piji Li , Wei Bi , Haiyun Jiang , Guoping Huang , Leyang Cui , Xinting Huang

分类：自然语言处理

2022-08-03

在这份技术报告中，我们介绍了数字写作助手（高效且智能编辑），该助手通过使用人工智能（AI）技术来促进用户更有效地编写更高质量的文本。以前的写作助理通常提供错误检查的功能（以检测和纠正拼写和语法错误）和有限的文本练习功能。随着大型神经语言模型的出现，一些系统支持自动完成句子或段落。在Effidit中，我们通过提供五个类别的功能来显着扩展写作助手的能力：文本完成，错误检查，文本抛光，关键字到句子（K2S）和云输入方法（Cloud IME）。在文本完成类别中，Effidit支持基于生成的句子完成，基于检索的句子完成和短语完成。相比之下，到目前为止，许多其他写作助理仅提供三个功能中的一两个。对于文本抛光，我们具有三个函数：（上下文感知）短语抛光，句子释义和句子扩展，而其他许多写作助手通常会在此类别中支持一两个功能。本报告的主要内容包括象征的主要模块，实施这些模块的方法以及一些关键方法的评估结果。

translated by 谷歌翻译

giMLPs: Gate with Inhibition Mechanism in MLPs

Cheng Kang , Jindich Prokop , Lei Tong , Huiyu Zhou , Yong Hu , Daneil Novak

分类：自然语言处理

2022-08-01

本文提出了一种新的模型架构，具有抑制MLP（GIMLP）的门。对CyClemlp（Gi-Cyclemlp）抑制的大门可以在Imagenet分类任务上产生同等的性能，并且还可以改善BERT，ROBERTA和DEBERTAV3型号关于两种新颖的技术。第一个是门控MLP，其中MLP和Trunk注意力输入之间的矩阵乘法在进一步调整模型的适应性中。第二个是抑制作用，它抑制或增强分支调节，并且随着抑制水平的增加，它提供了更大的肌肉特征限制。我们表明，就成像网分类的精度而言，抑制水平较低的GicyClemLP可能与原始CYCLEMLP具有竞争力。此外，我们还通过一项全面的实证研究表明，这些技术显着改善了微调NLU下游任务的性能。至于在Deberta（Gideberta）微调上具有抑制MLP的大门，我们发现它可以在NLU任务的大多数部分上取得吸引力的结果，而无需再进行任何额外的预处理。我们还发现，通过抑制栅极的使用，激活函数应具有短而光滑的负尾巴，而无关紧要的特征或受伤模型的特征可以适度抑制。对图像分类和增强自然语言微调的能力而没有任何额外预读的实验，对Imagenet和十二个语言的实验表明了GATE具有抑制作用的有效性。

translated by 谷歌翻译